Gene selection for the reconstruction of stem cell differentiation trees: a linear programming approach

نویسندگان

  • Mohamed A. Ghadie
  • Nathalie Japkowicz
  • Theodore J. Perkins
چکیده

MOTIVATION Stem cell differentiation is largely guided by master transcriptional regulators, but it also depends on the expression of other types of genes, such as cell cycle genes, signaling genes, metabolic genes, trafficking genes, etc. Traditional approaches to understanding gene expression patterns across multiple conditions, such as principal components analysis or K-means clustering, can group cell types based on gene expression, but they do so without knowledge of the differentiation hierarchy. Hierarchical clustering can organize cell types into a tree, but in general this tree is different from the differentiation hierarchy itself. METHODS Given the differentiation hierarchy and gene expression data at each node, we construct a weighted Euclidean distance metric such that the minimum spanning tree with respect to that metric is precisely the given differentiation hierarchy. We provide a set of linear constraints that are provably sufficient for the desired construction and a linear programming approach to identify sparse sets of weights, effectively identifying genes that are most relevant for discriminating different parts of the tree. RESULTS We apply our method to microarray gene expression data describing 38 cell types in the hematopoiesis hierarchy, constructing a weighted Euclidean metric that uses just 175 genes. However, we find that there are many alternative sets of weights that satisfy the linear constraints. Thus, in the style of random-forest training, we also construct metrics based on random subsets of the genes and compare them to the metric of 175 genes. We then report on the selected genes and their biological functions. Our approach offers a new way to identify genes that may have important roles in stem cell differentiation. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bio-printing Damaged Tissues: A Novel Approach in Regenerative Medicine

Regenerative medicine, deals with functional reconstruction of damaged tissues or organs after severe injuries chronic diseases, while body's natural responses are not sufficient. In this field, stem cells due to their exclusive potential in self-renewal and differentiation into other cell types, are the main sources of functional cells in regenerative medicine. However, challenges in stem cell...

متن کامل

Target setting in the process of merging and restructuring of decision-making units using multiple objective linear programming

This paper presents a novel approach to achieving the goals of data envelopment analysis in the process of reconstruction and integration of decision-making units by using multiple objective linear programming. In this regard, first, we review inverse data envelopment analysis models for data reconstruction and integration. We present a model with multi-objective linear programming structure in...

متن کامل

Cardiogel as an Instructive Microenvironment for in vitro Differentiation of Bone Marrow- Derived Mesenchymal Stem Cells into Cardiomyocytes

Background: Stem cell therapy has been developed as an effective treatment method for the heart failure. Also, extracellular matrix has shown the positive effects in stem cell differentiation and myocardial tissue organization. Cardiogel is a native cardiac extracellular matrix (ECM) derived from cardiac fibroblasts. In the present study the role of cardiogel is examin...

متن کامل

An Integrated Decision Making Model for Manufacturing Cell Formation and Supplier Selection

Optimization of the complete manufacturing and supply process has become a critical ingredient for gaining a competitive advantage. This article provides a unified mathematical framework for modeling manufacturing cell configuration and raw material supplier selection in a two-level supply chain network. The commonly used manufacturing design parameters along with supplier selection and a subco...

متن کامل

Gene Expression and Promoter Methylation Status of VHL, Runx-3, E-cadherin, P15 and P16 Genes During EPO-Mediated Erythroid Differentiation of CD34+ Hematopoietic Stem Cells

Background: VHL (von Hippel-Lindau), Runx-3 (Runt-related transcription factor 3), E-cadherin (Epithelial cadherin), P15 (INK4a, cyclin dependent kinase inhibitor), and P16 (INK4b) genes are essential in hematopoiesis. The aim of this study was to explore the correlation between gene expression and promoter methylation in CD34+ stem cells before and after differentiation to erythroid lineage. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 31 16  شماره 

صفحات  -

تاریخ انتشار 2015